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Description 



This invention relates to recombinant DNA 
techno! gy. It particulariy relates to the applica- 
tion of the technology to the production of human s 
serum albumin (HSA) in microorganisms for use 
in the therapeutic treatment of humans. In one 
aspect the invention relates to a technique for 
producing DNA sequences encoding desired 
polypeptides. In another aspect it relates to the w 
construction of microbial expression vehicles 
containing DNA sequences encoding a protein, 
e.g. human serum albumin or the biologically 
active component thereof operably linked to 
expression effecting promoter systems and to the is 
expression vehicles so constructed. In another 
aspect, the present invention relates to microor- 
ganisms transformed with such expression 
vehicles, thus directed in the expression of the 
DNA sequences referred to above. In yet other 20 
aspects, this invention relates to the means and 
methods of converting the end product of such 
expression to entities, such as pharmaceutical 
compositions, useful for the therapeutic treat- 
ment of humans. In preferred embodiments, this 25 
invention provides for particular expression vec- 
tors that are sequenced properly such that mature 
human serum albumin is produced directly. 

in one aspect, the present invention is particu- 
larly directed to a method of preparing cDNA 30 
encoding polypeptides or biologically active por- 
tions thereof. This aspect provides the means and 
methods of utilizing synthetic primer DNA corre- 
sponding to a portion of the mRNA of the 
intended polypeptide, adjacent to a known 35 
endonuclease restriction site, in order to obtain 
by reverse transcription a series of DNA frag- 
ments encoding sequences of the polypeptide. 
These fragments are prepared such that the entire 
desired protein coding sequence is represented, 40 
the individual fragments containing overlapping 
DNA sequences harboring common endonuc- 
lease restriction sites within the corresponding 
overlapping sequence. This aspect facilitates the 
selective cleavage and ligation of the respective 45 
fragments so as to assemble the entire cDNA 
sequence encoding the polypeptide in proper 
reading frame. This discovery permits the obten- 
tion of cDNA of high molecular weight proteins 
which otherwise may not be available through so 
use of usual reverse transcriptase methods and/or 
chemical synthesis. 

The publications and other materials hereof 
used to illuminate the background of the inven- 
tion, and in particular cases, to provide additional 55 
details respecting its practice are incorporated 
herein by reference, and for convenience, are 
numerically referenced in the following text and 
grouped in the appended bibliography. 

* 60 

(A) Human serum albumin 

Human erum albumin (HSA) Is the major pro- 
tein species In adult serum. It is pr duced in th 
liver and is largely responsible for maintaining 
normal osmolality in the bloodstream and func- es 



tions as a carrier for num rous serum m lecules 
(1, 2). The apparent fetal counterpart of HSA is a- 
fet protein and studies have been undertaken to 
compare th two as well as rat serum albumin 
and a-fetoprotein (3-8). The complete protein 
sequence of HSA has been published (9—12). The 
published protein sequences of HSA disagree in 
about 20 residues as well as in the total number of 
amino acids in the mature protein [584 amino 
acids (9); 585 (12)]. Some evidence suggests that 
HSA is initially synthesized as a precursor 
molecule (13, 14) containing a "prepro" 
sequence. The precursor forms of bovine (15) and 
rat (16) serum albumin have also been 
sequenced. 

The role or rationale for the use of albumin in 
therapeutic application is for the treatment of 
hypovolemia, hypoproteinemia and shock. Albu- 
min currently is used to improve the plasma 
oncotic (colloid osmotic) pressure, caused by 
solutes (colloids) which are not able to pass 
through capillary pores. Inasmuch as albumin has 
a low permeability constant, it essentially confi- 
nes itself to the intravascular compartment When 
different concentrations of nondiffusable particles 
exist on opposite sides of the cell membrane, 
water crosses the partition until the concen- 
trations of particles are equal on both sides. In 
this process of osmosis, albumin plays a vital role 
in maintaining the liquid content in blood. 

Thus, the therapeutic benefits of albumin 
administration reside primarily for the treatment 
of conditions where there is a loss of liquid from 
the intravascular compartment such as in surgi- 
cal operations, shock, burns, and hypop- 
roteinemia resulting in edema. Albumin is also 
used for diagnostic applications in which its 
nonspecific ability to bind to other proteins makes 
it useful in various diagnostic solutions. 

Presently, human serum albumin is produced 
from whole blood fractionation techniques, and 
thus is not available in large amounts at competi- 
tive costs. The application of recombinant DNA 
technology makes possible the production of 
copious amounts of human serum albumin by 
use of genetically directing microorganisms to 
produce it efficiently. The present invention may 
enable the availability of purified HSA produced 
through recombinant DNA technology more 
abundantly and at lower cost than is now 
presently possible. The present invention also 
provides knowledge of the DNA sequence 
organization of human serum albumin and its 
deduced amino acid sequence, helping to eluci- 
date the evolutionary, regulatory, and functional 
properties of human serum albumin as well as its 
related proteins such as alpha-fetoprotein. 

More particulariy, present invention provides 
for the isolation of cDNA clones spanning the 
entire sequence of the protein coding and 3' 
untranslated portions of HSA mRNA. These cDNA 
clones were used to construct a recombinant 
express! n vehicle which directed the expression 
in a micr organism strain of the mature HSA 
protein under control f the trp promoter. The 
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present Invention als provides the com p let 
nucleotide and deduced amino acid sequence of 
HSA. 

Reference herein to the expression of "mature 
human serum albumin" connotes the microbial 
production of human serum albumin unaccom- 
panied by the presequence ("prepro") that imme- 
diately attends translation of the human serum 
albumin mRNA. Mature human serum albumin, 
according to the present invention, is imme- 
diately expressed from a translation start signal 
(ATG), which also encodes the amino acid 
methionine linked to the first amino acid of 
albumin. This methionine amino acid can be 
naturally cleaved by the microorganism so as to 
prepare the human serum albumin directly. 
Mature human serum albumin can be expressed 
together with a conjugated protein other man the 
conventional leader, the conjugate being speci- 
fically cteavable in an intra- or extracellular 
environment See British patent publication 
number 2007676A. Finally, the mature human 
serum albumin can be produced in conjunction 
with a microbial signal polypeptide which trans- 
ports the conjugate to the cell wall, where the 
signal is processed away and the mature human 
serum albumin secreted. 

(B) Recombinant DNA technology 

With the advent of recombinant DNA tech- 
nology, the controlled microbial production of an 
enormous variety of useful polypeptides has 
become possible. Many mammalian polypep- 
tides, such as human growth hormone and 
human and hybrid leukocyte interferons, have 
already been produced by various microorgan- 
isms. The power of the technology admits the 
microbial production of an enormous variety of 
useful polypeptides, putting within reach the 
microbially directed manufacture of hormones, 
enzymes, antibodies, and vaccines useful for a 
variety of drug-targeting applications. 

A basic element of recombinant DNA tech- 
nology is the plasmid, an extrachromosomal loop 
of double-stranded DNA found in bacteria often- 
times in multiple copies per cell. Included in the 
information encoded in the plasmid DNA is that 
required to reproduce the plasmid in daughter 
cells (i.e., a "replicon" or origin of replication) and 
ordinarily, one or more phenotypic selection 
characteristics, such as resistance to antibiotics, 
which permit clones of the host cell containing 
the plasmid of interest to be recognized and 
preferentially grown in selective media. The utility 
of bacterial plasmids lies In the fact that they can 
be specifically cleaved by one or another restric- 
tion endonuclease or "restriction enzyme", each 
of which recognizes a different site on the plasmid 
DNA. Thereafter heterologous genes or gene 
fragments may be inserted into the plasmid by 
endwise j ining at the cleavage site or at recon- 
structed ends adjacent to the cl avage site. (As 
used herein, the term "heterologous" refers t a 
gen not rdinarily found in, or a polypeptide 
sequence ordinarily not produced by, a given 



micr rganism, whereas the term "homologous" 
refers to a gene r polypeptide which is found in, 
or produced by the corresponding wild-type 
microorganism.) Thus formed are so-called 
5 repiicable expression vehicles. 

DNA recombination is performed outside the 
microorganism, and the resulting "recombinant" 
repiicable expression vehicle, or plasmid, can be 
introduced into microorganisms by a process 

10 known as transformation and large quantities of 
the heterologous gene-containing recombinant 
vehicle obtained by growing the transformant. 
Moreover, where the gene is properly inserted 
with reference to portions of the plasmid which 

is govern the transcription and translation of the 
encoded DNA message, the resulting expression 
vehicle can be used to actually produce the 
polypeptide sequence for which the inserted gene 
codes, a process referred to as expression. 

20 Expression is initiated in a DNA region known 
as the promoter. In the transcription phase of 
expression, the DNA unwinds, exposing the 
sense coding strand of the DNA as a template for 
initiated synthesis of messenger RNA from the 5' 

25 to 3' end of the entire DNA sequence. The 
messenger RNA Is, in turn, bound by ribosomes, 
where the messenger RNA is translated into a 
polypeptide chain having the amino acid 
sequence for which the DNA codes. Each amino 

30 acid is encoded by a nucleotide triplet or "codon" 
which collectively make up the "structural gene", 
i.e., that part of the DNA sequence which encodes 
the amino acid sequence of the expressed poly- 
peptide product 

35 Translation is initiated at a "start" signal (ordi- 
narily ATG, which in the resulting messenger 
RNA becomes AUG). So-called stop codons, 
transcribed at the end of the structural gene, 
signal the end of translation and, hence, the 

40 production of further amino acid units. The result- 
ing product may be obtained by lysing the host 
cell and recovering the product by appropriate 
purification from other proteins. 
In practice, the use of recombinant DNA tech- 

45 nology can express entirely heterologous poly- 
peptides— so-called direct expression — or 
alternatively may express a heterologous poly- 
peptide, fused to a portion of the amino acid 
sequence of a homologous polypeptide. In the 

50 latter cases, the intended bioactive product is 
rendered bioinactive within the fused, homolog- 
ous/heterologous polypeptide until h is cleaved in 
an extracellular environment See Wetzel, 
American Scientist 68, 664 (1980). 

55 If recombinant DNA technology is to fully sus- 
tain its promise, systems must be devised which 
optimize expression of gene inserts, so that the 
intended polypeptide products can be made avail- 
able in controlled environments and in high 

60 yields. 

(C) State of the art 

Sargent eta/.. In Proc Natf. Acad. Set, (USA) 78, 
243 (1981), describe the cloning of rat serum 
$5 albumin messenger RNA in a series f recombin- 
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ant DNA plasmids. This was done to determine 
the nucleotide sequences of the clones in order to 
study the evolutionary hypothesis of the protein 
product. Thus, these workers made no attempt to 
assemble the cDNA fragments they prepared. 5 

In Journal of Supramolecular Structure and 
M ular Biochemistry. Supplement 5, 1981, Alan 
R. Liss, Inc. NY, Dugaiczyk at at. report in abstract 
form, their studies of the human gene for human 
serum albumin. They obtained cONA fragments 10 
Out there is no evidence that these workers cloned 
or produced by fragments for any purpose other 
than for studying the basic molecular biology of 
the a-fetoprotein and serum albumin genes. 

The present invention is based upon the discov- ;s 
ery that recombinant DNA technology can be 
used to successfully and efficiently produce 
human serum albumin in direct form. The product 
is suitable for use in therapeutic treatment of 
human beings in need of supplementation of 20 
a bumin. The product is produced by genetically 
directed microorganisms and thus the potential 
exists to prepare and isolate HSA in a more 
efficient manner that is presently possible by 
blood fractionation techniques. It is noteworthy 25 
that we have succeeded in genetically directing a 

iT , ^2S < S wm t0 produce a P rotein °f enormous 
^^T 584 8m,no acids corresponding to an 
mRNA transcript upwards of about 2,000 bases 

The present invention comprises the human 30 
serum albumin thus produced and the means and 
methods of its production. The present invention 
is further directed to replicable DNA expression 
vehicles harboring gene sequences encoding 
HSA in directly expressible form. Further, the 35 
present invention is directed to microorganism 
strains transformed with the expression vehicles 
described above and to microbial cultures of such 
transformed strains, capable of producing HSA. In 
still further aspects, the present invention is 40 
directed to various processes useful for preparing 
said HSA gene sequences, DNA expression 
vehicles, microorganism strains and cultures and 
to specific embodiments thereof. Still further, this 
invention is directed to the preparation of cDNA 45 
sequences encoding polypeptides which are 
heterologous to the microorganism host, such as 
human serum albumin, utilizing synthetic DNA 
primer sequences corresponding in sequence to 
regions adjacent to known restriction endonuc- so 
n\ S .A S,tes ' 8uch tnat '"dividual fragments of 
cDNA can be prepared which overlap in the 
regions encoding the common restriction 
endonuclease sites. This embodiment enables the 
precise cleavage and ligation of the fragments so 55 
as to prepare the properly encoded DNA 
sequence for the intended polypeptide. 

The work described herein involved the 
expression of human serum albumin (HSA) as a 
representative polypeptide which is heterologous bo 
to the microorganism employed as host Likewise 
the w rk descnbed inv Ived us f the microor- 
ganism £ coli K-12 strain 294 (end A, thr, hsr", 

Wo m J'JL d o!f Cr ^ d fn British Patent ^"cation 
No. 2055382A. Th.s strain has been dep sited as 



with the American Type Culture Collection, ATCC 
Accession No. 31446. 

The invention, in its most preferred 'embodi- 
ments, is described with reference to £ coli 
including not only strain £ coli K-12 strain 294,' 
defined above, but also other known £ coli 
strains such as £ co//B,£ co//xl776and£ co//w 
3110, or other microbial strains many of which are 
deposited and (potentially) available from recog- 
a m Iu r °° A r9anf . Sm de P° sit o^Y institutions, 
* A ^2? n Type Cu,ture Collection 
^TCC)-cf. the ATCC catalogue listing. Seel also 
German Offenlegungsschrift 2644432. These 
other microorganisms include, for example, 
Bacilli such as Bacillus subtilis and other 
enterobacteriaceae among which can be men- 
tioned as examples Salmonella typhimurium and 
Serratia marcesans, utilizing plasmids that can 
replicate and express heterologous gene 
sequences therein. Yeast, such as Saccharomyces 
cerevisiae, may also be employed to advantage 
as host organism in the preparation of the inser- 
tion proteins hereof by expression of genes cod- 
ing therefor under the control of a yeast pro- 
moter. (See the copending U.S. patent application 
of Hiteeman at ah, filed February 25, 1981 (Attor- 
ney Docket No. 100/43), assignee Genentech, Inc. 
mmZlf co u rras P ondin 9 European Application 
82300949.3 which are incorporated herein by 
reference. 7 

Preferred embodiments of the invention will 
now be described with reference to the accom- 
panying drawings in which: 

Figs. 1 A and B are diagrams for use in 
explaining the construction of plasmid pHSA1; 

Fig. 2 shows the immunoprecipitation of bacte- 
nally synthesised HSA; and 

Fig. 3 shows the amino acid sequence of HSA 
and the corresponding DNA sequence. 

In fig. 1A, the top line represents the mRNA 
coding for the human serum albumin protein and 
c J? W Jl the regions cont ained in the cDNA clones 
M7, F-61 and B-44 described further herein. The 
initial and final amino acid codons of the mature 
HSA mRNA are indicated by circled 1 and 585 
respectively. Restriction enconuclease sites 
involved in the construction of pHSA1 are shown 
by vertical lines. An approximate size scale is 
nucleotides is included. 

The completed pJasmid pHSA1 is shown in fig. 
1B, with HSA coding regions derived from cDNA 
clones shaded as in A). Selected restriction sites 
and terminal codons number 1 and 585 are 
indicated as above. The £ coli trp promoter- 
operator region is shown with an arrow 
representing the direction of transcription. G:C 
denotes an oligo dG:dC tail. The leftmost Xbal 
site and the initiation codon ATG were added 
synthetically. The tetracycline (Tc) and ampicillin 
(Ap) resistance genes in the pBR322 portion f 
pHSA1 are indicated by a h avy line. 

Figur 2 depicts th immunoprecipitation f 
bacterially synthesized HSA. 

£ coli cells transformed with albumin express- 
ion plasmid pHSAI (lanes 4 and 5) or control 
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plasmid pLelFA25 (containing an interferon a gene 
in the identical expression vehicl ;lan s 2, 3 and 7) 
were grown in "S-methionine-supplemented 
media. Samples in lanes 2, 4 and 7 were induced 
for expression from the trp promoter in M9 media 
lacking tryptophan; samples in lanes 3 and 5 were 
grown in tryptophan-containing LB broth to rep- 
ress the trp promoter. Each sample lane of the 
autoradiograph of the SDS-poiyacrylamide gel 
presented here contains labeled protein 
immunoprecipitated from 0.75 ml of cells at a 
density of Agso=1. Lanes 1 and 6 contain radio- 
active protein standards (BRL) whose molecular 
weight in kilodaltons is indicated at the left 
Bacterially synthesized HSA is seen in lane 4 
comigrating with the 68,000 d 14 C-labeled bovine 
serum albumin standards. Increased production of 
serum albumin in the induced versus repressed 
culture of pHSA1 represents higher levels of 
synthesis of plasmid encoded protein ratherthan a 
difference in "S-methionine pool specific 
activities for minimal versus rich media (data not 
shown). The sharp band at 60,000 d is an apparent 
artifact; this band is seen in both induced and 
repressed pHSAI and control transforms nts, and 
binds to preimmune (lane 7) as well as anti-HSA 
IgGs (lanes 2—5). The minor 47,000 d band in lane 
4 is apparently plasmid encoded and may 
represent a prematurely terminated form of bacte- 
rially synthesized HSA. 

Figure 3 depicts the nucleotide and amino acid 
sequence of human serum albumin. 

The DNA sequence of the mature protein coding 
and 3' untranslated regions of HSA mRNA were 
determined from the recombinant plasmid 
pHSAI. The DNA sequence of the prepro peptide 
coding and 5' untranslated regions were deter- 
mined from the plasmid P-14 (see text). Predicted 
amino acids are included above the DNA sequence 
and are numbered from the first residue of the 
mature protein. The preceding 24 amino acids 
comprise the prepro peptide. The five amino acid 
residues which disagree with the protein sequence 
of HSA reported by both Dayhoff (9) and Moulon et 
aL (12) are underlined. The above nucleotide 
sequence probably does not extend to the true 5' 
terminus of HSA mRNA. In the albumin direct 
expression plasmid pHSAI, the mature protein 
coding region is immediately preceded by the £ 
coli trp promoter-operator-leader peptide ribo- 
some binding site (36, 37), an artificial Xbal site, 
and an artificial initiation codon ATG; the prepro 
region has been excised. The nucleotides preced- 
ing HSA codon No. 1 in pHSAI read 

5'*TCACGTAAAAAGGGTATCTAGATG. 

Detailed description 

(A) Synthesis and cloning of cDNA. 

Poly(A)+ RNAwas prepared from quickly frozen 
human liver samples btained by bi psy r from 
cadaver donors by either ribonucle side-vanadyl 
complex (17) r guanidinium thiocyanate (18) 
proc dures. cDNA reactions were performed 
essentially as described in (19) employing as 



primers either oligo-deoxynucleotides prepared 
by the phosph tri ester method (20) or oligo 
(dTl ia . 18 (C llaborative Research). For typical 
cDNA reactions 25—35 ug of poly(A)+ RNA and 

5 40—80 pmol of oligonucleotide primer were 
heated at 90° for 5 minutes in 50 mM NaCI. The 
reaction mixture was brought to final concen- 
trations of 20 mMTris HCI pH 8.3, 20 mM KCI, 8 mM 
MgCI 2 , 30 mM dithiothreitol, 1 mM dATP, dCTP, 

w dGTP, dTTP (plus "P-dCTP (Amersham) to follow 
recovery of product) and allowed to anneal at 42°C 
for 5'. 100 units of AMV reverse transcriptase (BRL) 
were added and incubation continued at 42° for 45 
minutes. Second strand DNA synthesis, SI treat- 

is ment, size selection on polyacrylamide gels, deoxy 
(C) tailing and annealing to pBR322 which was 
cleaved with Pstl and deoxy (G) tailed, were 
performed as previously described (21, 22). The 
annealed mixture was used to transform £ coli K- 

20 12 strain 294 (23) by a published procedure (24). 

(B) Screening of recombinant plasmids with 32 P- 
labelled Probes. 

E. coli transformants were grown on LB-agar 

25 plates containing 5 ng/ml tetracycline, transferred 
to nitrocellulose filter paper (Schleicher and 
Schuell, BA85) and tested by hybridization using a 
modification of the in situ colony screening proce- 
dure (25). 32 P-end labelled (26) ol igo deoxy nuc- 

30 leotide fragments of from 12 to 16 nucleotides in 
length were used as direct hybridization probes, or 
32 P-cDNA probes were synthesized from RNA 
using oligo (dT) or oligodeoxynucleotide primers 
(19). Filters were hybridized overnight in 5X 

35 Denhardfs solution (27), 5xSSC, <lxSSC=*1.5M 
NaCI, 0.15 M Na Citrate) 50 mM Na phosphate pH 
6.8, 20 ug/ml salmon sperm DNA at a temperatures 
ranging from 4° to 42° and washed in salt concen- 
trations varying from 1 to 0.2xSSC plus 0.1 

40 percent SDS at temperatures ranging from 4° to 
42° depending on the length of the 32 P-labelled 
probe (28). Dried filters were exposed to Kodak XR- 
2 X-ray film using DuPont Lightning-Plus intensify- 
ing screens at -80°. 

45 

(C) DNA preparation and restriction enzyme analy- 
sis. 

Plasmid DNA was prepared in either large scale 
(29) or small scale ("miniprep"; 30) quantities and 
so cleaved by restriction endonucleases (New Eng- 
land Biolabs, BRL) following manufacturers con- 
ditions. Slab gel electrophoresis conditions and 
electrocution of DNA fragments from gels have 
been described (31). 

55 

(D) DNA sequencing. 

DNA sequencing was accomplished by both the 
method of Maxam and Gilbert (26) utilizing end- 
labelled DNA fragments and by dideoxy chain 
60 termination (32) on single stranded DNA from 
phage M13 mP7 subclones (33) utilizing synthetic 
oligonucleotide (20) primers. Each region was 
independently sequenced several times. 

65 
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(E) Construction of 5' end of ablumin gene for 
direct expression of HSA. 

10 Mg (-16 pmol) of the -1200 bp Pstl insert of 
plasrnid F-47 was boiled in H 2 0 f r 5 minutes and 
combined with 100 pmoi of M P-end labelled 5' s 
pnmer (d ATG G ATG CACACAAG ). The mixture 
was quenched on ice and brought to a final 
volume of 120 pi of 6 mM Tris HCI pH 7.5, 6 mM 
MgCI 2 , 60 mM Nad, 0.5 mM dATP, dCTP, dGTP 
dTTP at 0°. 1 0 units of DNA polymerase i Klenow w 
fragment (Boehringer-Mannheim) were added 
and the mixture incubated at 24° for 5 nr. Follow- 
ing phenol/chloroform extraction, the product 
was digested with Hpall, electrophoresed in a 5 
percent polyacrylamide gel, and the desired 450 rs 
bp fragment electroeluted. the single stranded 
overhang produced by Xbal digestion of the 
vector plasrnid pLelF A25 (21) was filled in to 
produce blunt DNA ends by adding deoxynuc- 
leoside triphosphates to 10 pM and 10 units DNA 20 
polymerase I Klenow fragment to the restriction 
endonuclease reaction mix and incubating at 12° 
for 10 minutes. Restriction endonuclease frag- 
ments (0.1—1 pg in approximate molar equality) 
were annealed and ligated overnight at 12° in 20 25 

M M f c 5 r2r? M Tris HCI P H 7 ' 6 ' 10 m ^ 0.1 
mM EDTA, 5 mM dithiothreitol, 1 mM rATP with 
50 units T4 ligase (N.E. Biolabs). Further details of 
plasrnid construction are discussed below. 

(F) Protein analysis. 30 

Two ml cultures of recombinant £ colt strains 
were grown in either LB or M9 media plus 5 pg/ml 
tetracycline to densities of A sw =1.0, pelleted, 
washed repeated, and suspended in 2 ml of LB 35 
or supplemented M9 (M9+0.2 percent glucose, 1 
pg/ml thiamine, 20 |ig/ml standard amino acids 
except methionine was 2 pg/ml and tryptophan 
was excluded). Each growth medium also con- 
tained 5 ug/ml tetracycline and 100 uCi "S- 40 
methionine (NEN; 1200 Ci/mmol). After 1 hr incu- 
bation at 37°, bacteria was pelleted, freeze-thawed 
and resuspended in 200 fit 50 mM Tris HCI pH 7.5 
0.12 mM NaEDTA then placed on ice for 10 
minutes following subsequent additions of 45 
lysozyme to 1 mg/ml, NP40 0.2 percent and NaCI 

u J The lysate was ad J'usted to 10 mM 
MgCI a and incubated with 50 pg/ml DNase I 
(Worthington) on ice for 30 min. Insoluble 
material was removed by mild centrifugation. 50 

, were im munoprecipitated with rabbit 
anti-HSA (Cappel Labs) and staphylococcal absor. 
bent (Pansorbin; Cal Biochem) as described (34) 
and subjected to SDS polyacrylamide gel elec- 
trophoresis (35). y 

55 

(G) cDNA cloning. 

Initial cDNA clones primed with aligo (dT) were 
screened by colony hybridization with both total 
liver cDNA (to identify abundant RNA species so 
containing clones) and with tw ^-labelled 
cDNAs primed from liver mRNA by two sets of 
four 11 base oligodeoxynucl tides synthesized 
to represent the possible coding variations for 
amino adds 546-549 and 294—297 f HSA. ss 



10 



So !tL,°',« e 5 . neV6r contai "«>d more than 
aboutthe3'V2 fthepr teinc dingr gion fthe 
expected HSA mRNA sequence. {The longest of 
these recombinants was designated B-44). Since 
existing procedures were unable to directly copy 
an mRNA of the expected size (-2000 bp), syn- 
thetic oligodeoxynucleotides were prepared to 
correspond to the antimessage strand at regions 
near the 5' extreme of B-44. From the nucleotide 
sequence of B-44, we constructed a 12 base 
2i£ ^S^ e °Tu de corres P°^ing to amino 
acids 369-373. This was used to prime cDNA 
synthesis of liver mRNA and produce cDNA clo- 
nes i in pBR322 containing the 5' portion of the 
HSA message while overlapping the existing B-44 
recombinant. Approximately 400 resulting clones 
were screened by colony hybridization with a 16 
base oligodeoxynucleotide fragment located 
slightly upstream in the mRNA sequence we had 
thus far determined. Approximately 40 percent of 
the colonies hybridized to both probes. Many of 
those colonies which failed to contain hybridizing 
plasmids presumably resulted from RNA self- 
priming or priming with contaminating oligo (dT) 
during reverse transcription, or lost the 3' region 
containing the sequence used for screening. 
Mmiprep amounts of plasrnid DNA from hyb- 
ridizing colonies were digested with Three 
recombinant plasmids contained sufficiently 

' a /Sf ulT 8 10 00018 f0r the raining 5' portion 
of the HSA message. Two of these (F-1 5 and F-47) 
contained the extreme 5' coding portion of the 
gene but failed to extend back to a Pstl site 
necessary for joining with B-44 to reform the 
complete albumin gene. Recombinant F-61 
possessed this site but lacked the 5' extreme end. 
A three part reconstruction of the entire message 
sequence was possible employing restriction 
endonuclease sites in common with the part 
length clones F-47, F-61 and B-44 (Fig. 1). 

An additional cDNA clone extending further 5' 
was obtained by similar oligodeoxynucleotide 
primed cDNA synthesis (from a primer corre- 
sponding to amino acid codon No. 175—179), 
Although not employed in the construction of the 
mature HSA expression plasrnid, this cDNA clone 
(P-14) allowed determination of the DNA 
sequence of the "prepro" peptide coding and 5' 
non-coding regions of the HSA mRNA. 

The mature HSA mRNA sequence was joined to 
a vector plasrnid for direct expression of the 
mature protein in £ colt via the tip promoter- 
operator. The plasrnid pLelF A25 directs the 

SS^f? 1 '?" 0f human ,eukocvt e interferon A 
(lFNo2) (21). It was digested with Xbal and the 
cleavage site "filled in" to produce blunt DNA 
ends with DNA polymerase I Klenow fragment 
and deoxynucleoside triphosphates. After subse- 
quent digestion with Psfl, a "vector" fragment 
was gel purified that contained pBR322 
sequences and a 300 bp fragment f the £ co//trp 
promot r, perator, and rib some binding site of 
th trp leader peptide terminating in the artifi- 
cially blunt ended 2*bs! cleavage site. A 15 base 
"g de xynucle tide was designed t c ntain 
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the initiation codon ATG followed by the 12 
nucte tides c ding f r the first f ur amin acids 
of HSA as determined by DNA sequence analysis 
of clone F-47. In a process referred to as "primer 
repair", the gene-containing Pstl fragment F-47 
was denatured, annealed with excess 15-mer and 
reacted with DNA polymerase I Klenow fragment 
and deoxynucleoside triphosphates. This reaction 
extends a new second strand downstream from 
the annealed oligonucleotide, degrades the single 
stranded DNA upstream of codon number one 
and then polymerizes upstream three nucleotides 
complementary to ATG. In addition, when this 
product is blunt-end ligated to the prepared vec- 
tor fragment, its initial adenosine residue 
recreates an Xbal restriction site. Following the 
primer repair reaction, the DNA was digested with 
Hpall and a 450 bp fragment containing the 5' 
portion of the mature albumin gene was gel 
purified (see Rg. 1). This fragment was annealed 
and ligated to the vector fragment and to the gel 
isolated Hpall to fsll portion of F-47 and used to 
transform £ coli cells. Diagnostic restriction 
endonuclease digests of plasmid minipreps iden- 
tified the recombinant A-26 which contained the 
5' portion of the mature albumin coding region 
ligated properly to the trp promoter-operator. For 
the final steps fn assembly, the A-26 plasmid was 
digested with BgPII plus Pstl and the -4 kb 
fragment was gel purified. This was annealed and 
ligated to a 390 bp Pstl, Bglll partial digestion 
fragment purified from F-61 and a 1000 bp Pstl 
fragment of B-44. Restriction endonuclease analy- 
sis of resulting transformants identified plasmids 
containing the entire HSA coding sequence 
properly aligned for direct expression of the 
mature protein. One such recombinant plasmid 
was designated pHSA1. When £ coli containing 
pHSA1 is grown in minimal media lacking tryp- 
tophan, the cells produce a protein which speci- 
fically reacts with HSA antibodies and comigrates 
with HSA in SDS polyacrylamide electrophoresis 
(Rg. 2). No such protein is produced by identical 
recombinants grown in rich broth, implying that 
production in £ coli of the putative HSA protein is 
under control of the trp promoter-operator as 
designed. To insure the integrity of the HSA 
structural gene in the recombinant plasmid, 
pHSA1 was subject to DNA sequence analysis. 

(H) DNA sequence analysis 

The albumin cDNA portion (and surrounding 
regions) of pHSA1 were sequenced to completion 
by both the chemical degradation method of 
Maxam and Giblert (26) and the dideoxy chain 
termination procedure employing templates 
derived from single stranded M13 mP7 phage 
derivatives (32, 33). All nucleotides were 
sequenced at least twice. The DNA sequence is 
shown in Rg. 3 along with the predicted amino 
acid sequenc of the HSA protein. Th DNA 
sequence farther 5' to the mature HSA coding 
region was also determined from the cDNA clone 
P-14 and is Included in Rg. 3. 

DNA sequence analysis confirmed that the arti- 



ficial initiation codon and the complete mature 
HSA coding sequence directly follows the £ coli 
trp promoter-operator as desired. The ATG 
initiat r f Hows the putative £ coli ribosome 
5 binding sequence (36) of the trp leader peptide 
(37) by 9 nucleotides. 

Translation of the DNA sequence of pHSA1 
predicts a mature HSA protein of 585 amino acids. 
Various published protein sequences of HSA dis- 
10 agree at about 20 amino acids. The present 
sequence differs by eleven residues from Mouton 
et at. (12), and by 28 residues from that reported 
in the Dayhoff catalogue (9) credited as arising 
primarily from Behrens et al. (10) with contribu- 
te tions by Moulon ef al. (12). Most of these 
differences represent inversions of pairs of adja- 
cent residues or glutamine-glutamic acid disag- 
reements. Only at five of the 585 residues does 
our sequence differ from the residue reported by 
20 both Dayhoff (9) and Moulon et al. (12), and three 
of these five differences represent glutamine- 
glutamic acid interchanges (underlined in Figure 
3). At all discrepant positions the nucleotide 
sequencing has been carefully rechecked and it is 
25 unlikely that DNA sequencing errors are the cause 
of these reported differences. The possibility of 
artifacts introduced by cDNA cloning cannot be 
ruled out. However, other likely explanations exist 
for the amino acid sequence differences among 
30 various reports. These include changes in amida- 
tion (affecting glutamine-glutamic acid discrimi- 
nation) occurring either in vivo or during protein 
sequencing (38). Polymorphism in HSA proteins 
may also account for some differences; over 
as twenty genetic variants of HSA have been 
detected by protein electrophoresis (39) but have 
not yet been analyzed at the amino acid sequence 
level. It is also worth noting that our predicted 
HSA protein sequence is 585 amino acids long, in 
40 agreement with Moulon (12) but not Dayhoff (9). 
The difference is accounted for by the deletion (in 
ref. 9) of one phenylalanine (Phe) residue in a Phe- 
Phe pair at amino acids 156—157. 
When compared to the DNA sequence of a rat 
45 serum albumin cDNA CLONE (16) the present 
mature HSA sequence shares 74 percent homol- 
ogy at the nucleotide and 73 percent homology at 
the amine- acid level. (The rat SA protein is one 
amino acid shorter than HSA; the car boxy ter- 
se? minal residue of HSA is absent in the rat protein.) 
All 35 cysteine residues are located in identical 
positions in both proteins. The predicted "pre- 
pro" peptide region of HSA shares 76 percent 
nucleotide and 75 percent amino acid homology 
55 with that reported from the rat cDNA clone (16). 
Interspecies sequence homology is reduced in the 
portion of the 3' untranslated region which can be 
compared (the published rat cDNA clone ends 
before the 3' mRNA terminus). The HSA cDNA 
60 contains the hexanucleotide AATAAA 28 nuc- 
le tides before the site f poly (A) additi n. This is 
a comm n feature of eukary tic mRNAs first 
noted by Proudfoot and Br wnlee (40). 

65 
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Pharmaceutical compositi ns 

The comp unds of the present invention can be 
formulated according to kn wn methods t pre- 
pare pharmaceutical^ us ful comp sitions, 
whereby the polypeptide hereof is combined in 
admixture with a pharmaceutically acceptable 
carrier vehicle. Suitable vehicles and their formu- 
lation are described in Remington's Pharmaceuti- 
cal Sciences by E. W. Martin, which is hereby 
incorporated by reference. Such compositions 
will contain an effective amount of protein hereof 
together with a suitable amount of vehicle in 
order to prepare pharmaceutically acceptable 
compositions suitable for effective administration 
to the host One preferred mode of administration 
is parenteral. 
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?B^ 8 ^M??J? n ? 8Ctin9States: BE, CH, DE, FR, 
Go, IT, U, LU f NL SE 

1. A DNA isolate comprising a continuous 
sequence encoding human serum albumin of the 
amino acid sequence depicted in Fig. 3 hereof and 
genetic variants thereof. 

2. An expression vehicle comprising a DNA 
coding sequence according to claim 1 operably 
linked with a DNA vector capable of effecting the 
microbial expression of said sequence so as to 
prepare the corresponding human serum albu- 
min. 

3. A microorganism transformed with the 
vehicle according to claim 2. 

4. The microorganism according to claim 3, 
obtained by transforming an £ coli bacterial or a 
yeast strain. 

5. A process which comprises microbially 
expressing human serum albumin of the amino 
acid sequence depicted in Fig. 3 hereof and 
genetic variants thereof. 

6. A process which comprises expressing 
human serum albumin in a transformed microor- 
ganism of claim 3 or daim 4. 

7. A process of claim 5 or claim 6 which further 
includes using the human serum albumin to 
prepare a pharmaceutical composition for the 
therapeutic treatment of humans. 

Claims for the Contracting State: AT 

1. A method which comprises the preparation 
of a DNA isolate comprising a continuous 
sequence encoding human serum albumin of the 
amino acid sequence depicted in Fig. 3 hereof and 
genetic variants thereof. 

2. A method which comprises the preparation 
of an expression vehicle comprising a DNA cod- 
ing sequence according to claim 1 operably linked 
with a DNA vector capable of effecting the 
microbial expression of said sequence so as to 
prepare the corresponding human serum albu- 
min. 

3. A microorganism transformed with the 
expression vehicle as defined in claim 2. 

4. The microorganism according to claim 3 
obtained by transforming an £ coli bacterial or a 
yeast strain. 

5. A process which comprises microbially 
expressing human serum albumin of the amino 
acid sequence depicted in Fig. 3 hereof and 
genetic variants thereof. 

6. A process which comprises expressing 
human serum albumin in a transformed microor- 
ganism of claim 3 or claim 4. 

7. A process f claim 4 or daim 6 which further 
includes using the human serum albumin t 
prepare a pharmaceutical comp siti n f r the 
therapeutic treatment of humans. 
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Paterrtanspruche fur die Vertragsstaaten: BE CH 
DE FR GB IT U LU NL SE 

1. DNA-lsolat umfassend eine kontinuierliche 
s Sequenz kodierend fur menschliches Serum-Alb- 
umin der in Fig. 3 der voriiegenden Anmeldung 
dargestellten Aminosauresequenz und geneti- 
sche Varianten desselben. 

2. Expressionsvehikel umfassend eine DNA- 
to Kodierungsequenz nach Anspruch 1 , die betriebs- 

fahig an einen DNA-Vektor gebunden ist, der 
fahig ist, die mikrobielle Expression der ge- 
nannten Sequenz zu bewirken, urn das ent- 
sprechende menschliche Serum-Albumin her- 
15 zustellen. 

3. Mikroorganismus, der mit den Vehikel nach 
Anspruch 2 transformiert ist. 

4. Mikroorganismus nach Anspruch 3, der 
durch Transformieren eines E. coli Bakterien- 

20 oder eines Hefestammes erhalten wird. 

5. Verfahren umfassend die mikrobielle Expres- 
sion von menschlichem Serum-Albumin mit der 
in Fig. 3 der voriiegenden Anmeldung dargestell- 
ten Aminosauresequenz und genetische Varian- 
ts ten desselben. 

6. Verfahren umfassend die Expression von 
menschlichem Serum-Albumin in einem transfor- 
mierten Mikroorganismus nach Anspruch 3 Oder 
4. 

30 7. Verfahren nach Anspruch 5 Oder 6, das 
welters die Verwendung des menschlichen 
Serum-Albumins zur Herstellung einer pharma- 
zeutischen Zusammensetzung fur die therapeuti- 
sche Behandlung des Menschen umfaSt. 

35 

Patentanspruche fflr den Vertagsstaat: AT 

rJ; Verfahren umfassen <J die Herstellung eines 
DNA-isolates, das eine kontinuierliche Sequenz 
*o kodierend fOr menschliches Serum-Albumin der 
m Fig. 3 der voriiegenden Anmeldung dargestell- 
ten Aminosauresequenz und genetische Varian- 
ten desselben umfaSt. 

2. Verfahren umfassend die Herstellung eines 
•45 Expressionsvehikels, das eine fur DNA kodie- 

rende Sequenz nach Anspruch 1 umfaSt, die 
betriebsfahig mit einem DNA-Vektor verbunden 
ist, der fahig ist, die mikrobielle Expression der 
genannten Sequenz zu bewirken, urn das entspre- 
so chende menschliche Serum-Albumin herzustel- 
len. 

3. Mikroorganismus, der mit dem in Anspruch 2 
definierten Expressionsvehikel transformiert ist 

4. Mikroorganismus nach Anspruch 3, der 
ss durch Transformieren eines E. coli-Bakterien- 

oder eines Hefestammes erhalten wird. 

5. Verfahren umfassend die mikrobielle Expres- 
sion von menschlichem Serum-Albumin der in 
Fig. 3 der voriiegenden Anmeldung dargestellten 

60 Aminosauresequenz und genetischer Varianten 
desselben. 

6. Verfahren umfassend die Expression v n 
menschlichem Serum-Albumin in einem transf r- 
mterten Mikro rganismus nach Anspruch 3 oder 

65 4. 
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7. Verfahren nach Anspruch 5 oder 6, waiters 
umfassend die Verw ndung des menschlichen 
Serum-Albumins zur Herstellung einer pharma- 
zeutischen Zusammensetzung fur die therapeuti- 
sche Behandlung des Menschen. 

Revendicatlons pour les Etats contractants: BE 
CH DE FR GB IT U LU NL SE 

1. Produit isote d'ADN comprenant une 
sequence continue codant pour la serum-albu- 
mine humaine de la sequence d'acides amines 
presentee a la figure 3 et ses variants gen&iques. 

2. V6hicule d'expression comprenant une 
sequence d'ADN de codage selon la revendica- 
tion 1 Ii6 de fagon op^ratoire avec un vecteur 
d'ADN capable d'accomplir I'expression micro- 
bioiogique de ladite sequence de fagon a preparer 
(a serum-albumine humaine correspondante. 

3. Micro-organisme transform^ avec la v^hicule 
selon la revendication 2. 

. 4. Micro-organisme selon la revendication 3, 
obtenu par transformation d'une souche bact6- 
rienne de £ coll ou d'une souche de levure. 

5. Proced$ qui comprend I'expression micro- 
biologique de la sirum-albumine humaine de la 
sequence d'acides amines presentee a la figure 3 
et de ses variants g£netiques. 

6. Procdde* qui comprend I'expression de la 
s6rum-albumine humaine dans un micro-orga- 
nisme transform^ selon la revendication 3 ou la 
revendication 4, 

7. Proc6d6 selon la revendication 5 ou la reven- 
dication 6, qui comprend en outre I'utilisation de 
la serum-albumine humaine pour preparer une 
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compositi n pharmaceutique pour le traitement 
therapeutique des humains. 

Revindications pour I'Etat contractant: AT 

5 

1. Proc6d6 qui comprend la preparation d'un 
produit isole d'ADN comprenant une sequence 
continue codant pour la serum-albumine 
humaine de la sequence d'acides amines presen- 
ce tee a la figure 3 et ses variants gSnetiques. 

2. Proc§d§ qui comprend la preparation d'un 
v§hicule d'expression comprenant une sequence 
d'ADN de codage selon la revendication 1 lie" de 
facon opeVatoire a un vecteur d'ADN capable 

fs d'effectuer I'expression microbiologique de ladite 
sequence de fagon a preparer la serum-albumine 
correspondante. 

3. Micro-organisme transform^ avec le vdhicule 
d'expression tel que difini a la revendication 2. 

20 4. Micro-organism selon la revendication 3, 
obtenu par transformation d'une souche bact6- 
rienne de £ co/i ou d'une souche de levure. 

5. ProcSde qui comprend ('expression micro- 
biologique de la serum-albumine humaine de la 

25 sequence d'acides amines representee a la figure 
3 et de ses variants gene>ique. 

6. Proc6d6 qui comprend ('expression de la 
serum-albumine humaine dans un micro-orga- 
nisme transform^ selon fa revendication 3 ou la 

30 revendication 4. 

7. Proc6d6 selon la revendication 5 ou la reven- 
dication 6 qui comprend en outre I'utilisation de la 
serum-albumine humaine pour preparer une 
composition pharmaceutique pour le traitement 

35 therapeutique des humains. 
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S£ Tr * v * 1 Thr Pht lit S«r Itu Itu fttt Uu Pht Str W Ala Tvr W Am «« v.i *_ 
ACA ATS AAfi TGfi 6TA ACC TTT ATT TO CTT CTT TTT CTC TTT AGC TC6 GCT TAT TCC Afil 06? GTG TTT CG? CgZ 



MNittff*) 
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»&3M£Satt»aaS£&i£SSaK3a»£ 

£M»&aaa$2&%a*as&asas*&s 

»&%%8!aaa«&aasaji?aaKaaaasaS 
ssssftaas&xa&a&Ka&a&aaa&asi 

%£&&ffis&aKaaaa&»Kssaaa&a£2 
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Vll All A1« Str Gin Al« All Uv Gly Ltu tod 

GTT OCT GCA ACT CAA GCT GCC TTA GGC TTA TAA CATCTACATTTAAAAGCATC TUlCCCTACCATGAGMTAAfiAGAAAAAMATGAA 

GATCAAAAttnATTCATCTTJTTTTCTTTTTCGTTW 

GTtXTTUAnUTAAAAMTGGAAAfiAA^ 

TGGAAfiTKCAfiTGTTC TCTCTTATTC CACTTCK ^ 1y(A , 
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